KMID : 1022420220140040035
|
|
Phonetics and Speech Sciences 2022 Volume.14 No. 4 p.35 ~ p.43
|
|
Performance comparison on vocal cords disordered voice discrimination via machine learning methods
|
|
Jo Cheol-Woo
Wang Soo-Geun Kwon Ick-Hwan
|
|
Abstract
|
|
|
This paper studies how to improve the identification rate of laryngeal disability speech data by convolutional neural network (CNN) and machine learning ensemble learning methods. In general, the number of laryngeal dysfunction speech data is small, so even if identifiers are constructed by statistical methods, the phenomenon caused by overfitting depending on the training method can lead to a decrease the identification rate when exposed to external data. In this work, we try to combine results derived from CNN models and machine learning models with various accuracy in a multi-voting manner to ensure improved classification efficiency compared to the original trained models. The Pusan National University Hospital (PNUH) dataset was used to train and validate algorithms. The dataset contains normal voice and voice data of benign and malignant tumors. In the experiment, an attempt was made to distinguish between normal and benign tumors and malignant tumors. As a result of the experiment, the random forest method was found to be the best ensemble method and showed an identification rate of 85%.
|
|
KEYWORD
|
|
diagnosis, glottic cancer, vocal cords disorder, machine learning, convolutional neural network
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|